Word recognition through mapping of lip movements from speech utterance using audiovisual fusion and MLP
نویسندگان
چکیده
Speech has information more than text, but under noisy environment speech sufferance from disadvantage of not properly decoded by humans and same is true with machines. being bimodal along audio features if we augment visual specifically related to lip movements. the degree recognition can be improved. The objective this work use aid word recognition. In extracted MFCC for Geometrical movements together used in machine learning algorithm predict utterances. Videos utterances are TIMID database. With statistical corresponding form input feature vector (Multi-layer perceptron). experimental results show that using MLP have obtained a accuracy 91% KNN Classifier attained 61%. presented here important implications applications HMI communication helps hearing impaired.
منابع مشابه
Lip movements affect infants' audiovisual speech perception.
Speech is robustly audiovisual from early in infancy. Here we show that audiovisual speech perception in 4.5-month-old infants is influenced by sensorimotor information related to the lip movements they make while chewing or sucking. Experiment 1 consisted of a classic audiovisual matching procedure, in which two simultaneously displayed talking faces (visual [i] and [u]) were presented with a ...
متن کاملThai Word Recognition Using Hybrid MLP-HMM
The Hidden Markov Model (HMM) is a popular model for speech recognition systems. However, one of the difficulties in applying HMM is the estimation of the emission probabilities for constructing the Gaussian Mixture Models (GMMs). In this paper, we propose a method to estimate the state emission probabilities in HMM framework using Artificial Neural Networks (ANNs), particularly the Multi-Layer...
متن کاملWhole-Word Recognition from Articulatory Movements for Silent Speech Interfaces
Articulation-based silent speech interfaces convert silently produced speech movements into audible words. These systems are still in their experimental stages, but have significant potential for facilitating oral communication in persons with laryngectomy or speech impairments. In this paper, we report the result of a novel, real-time algorithm that recognizes wholewords based on articulatory ...
متن کاملthe effect of vocabulary instruction through semantic mapping on learning and recall of efl learners
چکیده ندارد.
15 صفحه اولFuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition
In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Health Sciences (IJHS)
سال: 2022
ISSN: ['2550-6978', '2550-696X']
DOI: https://doi.org/10.53730/ijhs.v6ns2.6078